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Abstract. The construction of perfect hash functions is a well-studied 
topic. In this paper, this concept is generalized with the following defi- 
nition. We say that a family of functions from [n] to [k] is a (5-balanced 
(n, fc)-family of perfect hash functions if for every S C [n], \S\ = k, the 
number of functions that are 1-1 on S is between T/S and ST for some 
constant T > 0. The standard definition of a family of perfect hash func- 
tions requires that there will be at least one function that is 1-1 on 5", for 
each S of size k. In the new notion of balanced families, we require the 
number of 1-1 functions to be almost the same (taking S to be close to 
1) for every such S. Our main result is that for any constant (5 > 1, a 5- 
balanced (n, fc)-family of perfect hash functions of size 2'^'*'°^'°^'') logn 
can be constructed in time 2'^''''°^'°^ '°'n log n. Using the technique of 
color-coding we can apply our explicit constructions to devise approxi- 
mation algorithms for various counting problems in graphs. In particular, 
we exhibit a deterministic polynomial time algorithm for approximating 
both the number of simple paths of length k and the number of simple 
cycles of size k for any k < O( iog logi'ogn ) ^ graph with n vertices. The 
approximation is up to any fixed desirable relative error. 
Key words: approximate counting of subgraphs, color-coding, perfect 
hashing. 

1 Introduction 

This paper deals with explicit constructions of balanced families of perfect hash 
functions. The topic of perfect hash functions has been widely studied under 
the more general framework of fc-restriction problems (see, e.g., [3], [13]). These 
problems have an existential nature of requiring a set of conditions to hold at 
least once for any choice of k elements out of the problem domain. We gener- 
alize the definition of perfect hash functions, and introduce a new, simple, and 
yet useful notion which we call balanced families of perfect hash functions. The 
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purpose of our new definition is to incorporate more structure into the construc- 
tions. Our explicit constructions together with the method of color-coding from 
[5] are applied for problems of approximating the number of times that some 
fixed subgraph appears within a large graph. We focus on counting simple paths 
and simple cycles. Recently, the method of color-coding has found interesting ap- 
phcations in computational biology ([17], [18], [19], [12]), specifically in detecting 
signaling pathways within protein interaction. This problem is formalized using 
an undirected edge-weighted graph, where the task is to find a minimum weight 
path of length k. The application of our results in this case is for approximating 
deterministically the number of minimum weight paths of length k. 

Perfect Hash Functions. An (n, fc)-family of perfect hash functions is a 

family of functions from [n] to [k] such that for every S C [n], j^l = k, there 
exists a function in the family that is 1-1 on S. There is an extensive literature 
dealing with explicit constructions of perfect hash functions. The construction 
described in [5] (following [11] and [16]) is of size 2'^('^Mogn. The best known 
explicit construction is of size e'^fc'^'^'"^'') logn, which closely matches the known 
lower bound of /2(e'^ \ogn/Vk) [15]. 

Finding and Counting Paths and Cycles. The foundations for the graph 

algorithms presented in this paper have been laid in [5]. Two main randomized 
algorithms are presented there, as follows. A simple directed or undirected path 
of length — 1 in a graph G = {V, E) that contains such a path can be found 
in 2'^(*'^[£'| expected time in the directed case and in 2'^*^'^^|i/| expected time in 
the undirected case. A simple directed or undirected cycle of size in a graph 
G = {y,E) that contains such a cycle can be found in either 2'^('^)|F||£J| or 
20(/c)|y|a) expected time, where to < 2.376 is the exponent of matrix multiplica- 
tion. The derandomization of these algorithms incur an extra log factor. As 
for the case of even cycles, it is shown in [20] that for every fixed k>2, there is 
an 0(|V^|^) algorithm for finding a simple cycle of size 2k in an imdirected graph. 
Improved algorithms for detecting given length cycles have been presented in [6] 
and [21]. An interesting result from [6], related to the questions addressed in 
the present paper, is an OdT^j"^) algorithm for counting the number of cycles of 
size at most 7. Flum and Grohe proved that the problem of counting exactly the 
number of paths and cycles of length k in both directed and undirected graphs, 
parameterized by k, is #VK[l]-complcte [10]. Their result implies that most likely 
there is no f{k) ■ n'^-algorithm for counting the precise number of paths or cy- 
cles of length A; in a graph of size n for any computable function / : IN — > IN 
and constant c. This suggests the problem of approximating these quantities. 
Arvind and Raman obtained a randomized fixed-parameter tractable algorithm 
to approximately count the number of copies of a fixed subgraph with bounded 
treewidth within a large graph [7]. We settle in the affirmative the open ques- 
tion they raise concerning the existence of a deterministic approximate counting 
algorithm for this problem. For simplicity, we give algorithms for approximately 
counting paths and cycles. These results can be easily extended to the problem 
of approximately counting bounded treewidth subgraphs, combining the same 
approach with the method of [5]. The main new ingredient in our deterministic 



algorithms is the apphcation of balanced families of perfect hash functions- a 
combinatorial notion introduced here which, while simple, appears to be very 
useful. 

Balanced Families of Perfect Hash Functions. We say that a family of 
functions from [n] to [k] is a 5-balanced {n, fc)-family of perfect hash functions 
if for every S* C [n], \S\ = k, the number of functions that are 1-1 on S is 
between T/5 and ST for some constant T > 0. Balanced families of perfect 
hash functions are a natural generalization of the usual concept of perfect hash 
functions. To assist with our explicit constructions, wc define also the even more 
generalized notion of balanced splitters. (See section 2 for the definition. This is 
a generalization of an ordinary splitter defined in [15].) 

Our Results. The main focus of the paper is on explicit constructions of 
balanced families of perfect hash functions and their applications. First, we give 
non-constructive upper bounds on the size of different types of balanced split- 
ters. Then, we compare these bounds with those achieved by constructive al- 
gorithms. Our main result is an explicit construction, for every 1 < i5 < 2, 
of a 5-balanced (n, fc)-family of perfect hash functions of size 2'^('^'°s^°sfe)(^ _ 
•|^^-0(iogfe) logn. The running time of the procedure that provides the construc- 
tion is 20('='°s'°g'=)(5- l)-0(i°s'=)nlogn-|- (5- 

Constructions of balanced families of perfect hash functions can be applied 
to various counting problems in graphs. In particular, we describe deterministic 
algorithms that approximate the number of times that a small subgraph appears 
within a large graph. The approximation is always up to some m\iltiplic;ativc 
factor, that can be made arbitrarily close to 1. For any 1 < 6 <2, the number of 
simple paths of length A; — 1 in a graph G = {V, E) can be approximated up to 
a multiplicative factor of 5 in time 2«(*^i°gi°g*^)((^ - i)-O(iogfe) [£:[ iog|v^[ + (5 - 
\^-o{k/ log fc) ^ -j-jj^g number of simple cycles of size k can be approximated up to a 
multiplicative factor of 5 in time 20('='°e'°s'=)(5- log|V| + {5 - 

]^-)-0(fe/logfe)_ 

Techniques. We use probabilistic arguments in order to prove the existence 
of different types of small size balanced splitters (whose precise definition is given 
in the next section). To construct a balanced splitter, a natural randomized al- 
gorithm is to choose a large enough number of independent random functions. 
We show that in some cases, the method of conditional probabilities, when ap- 
plied on a proper choice of a potential function, can derandomize this process 
in an efficient way. Constructions of small probability spaces that admit fc-wise 
independent random variables are also a natural tool for achieving good split- 
ting properties. The use of error correcting codes is shown to be useful when we 
want to find a family of functions from [n] to [Z], where I is much bigger than 
/c^, such that for every S [n\, \S\ = k, almost all of the functions should be 
1-1 on S. Balanced splitters can be composed in different ways and our main 
construction is achieved by composing three types of splitters. Wc apply the ex- 
plicit constructions of balanced families of perfect hash functions together with 
the color-coding technique to get our approximate counting algorithms. 



2 Balanced Families of Perfect Hash Functions 



In this section we formally define the new notions of balanced families of perfect 
hash functions and balanced splitters. Here are a few basics first. Denote by [n] 
the set {1, . . . , n}. For any k, 1 < k < n, the family of fc-sizcd subsets of [n] is 
denoted by ('^'). We denote by k mod I the unique integer < r < Z for which 
k = ql + r, for some integer q. We now introduce the new notion of balanced 
families of perfect hash functions. 

Definition 1. Suppose that 1 < k < n and 5 > I. We say that a family of 
funetions from, [n] to [k] is a S-balanced {n,k)-family of perfeet hash functions 
if there exists a constant real number T > 0, such that for every S € the 
number of functions that are 1-1 on S, which we denote by inj{S), satisfies the 
relation T/S < inj{S) < 6T. 

The following definition generalizes both the last definition and the definition 

of a splitter from [15]. 

Definition 2. Suppose that 1 < k < n and 6 > 1, and let H be a family of 
functions from [n] to [I]. For a set S G (}^) we denote by split{S) the number of 

functions h E H thai split S into equal-sized parts h~^{j) f^S, j = 1, . . . ,1. In 
case I does not divide k we separate between two cases. If k < I, then split{S) 
is defined to be the number of functions that are 1-1 on S. Otherwise, k > I 
and we require the first k mod I parts to be of size \k/l~\ and the rem,aining 
parts to be of size lk/l\ . We say that H is a 6-balanced (n, k, l)-splitter if there 
exists a constant real number T > 0, such that for every S G ('^') we have 
T/5 < split{S) < ST. 

The definitions of balanced families of perfect hash functions and balanced 
splitters given above enable us to state the following easy composition lemmas. 

Lemma 1. For any k < I, let H be an explicit 6-balanced {n,k, I) -splitter of 
size N and let G be an explicit ^-balanced {I, k)-family of perfect hash functions 
of size M. We can use H and G to get an explicit 6j-balanced {n,k)-family of 

perfect hash functions of size NM . 

Proof. We compose every function of H with every function of G and get the 
needed result. □ 

Lemma 2. For any k > I, let H be an explicit S-balanced {n,k, I) -splitter of 

size N . For every j , j — 1, . . . , I, let Gj be an explicit jj-balanced (n, kj)-family 
of perfect hash functions of size Mj, where kj = \k/l~\ for every j < k mod I 
and kj = [k/l\ otherwise. We can use these constructions to get an explicit 
{SYl^j^i ^j)-balanced (n, k)-family of perfect hash functions of size NY]!^-^^ Mj. 

Proof. We divide the set [k] into I disjoint intervals /i, . . . , J;, where the size of 
Ij is kj for every j = 1, . . . , Z. We think of Gj as a family of functions from [n] to 
Ij. For every combination of h G H and gj G Gj, j = 1, . . . ,1, we create a new 
function that maps an element x G [n] to gh{x){x)- ^ 



3 Probabilistic Constructions 



We will use the following two claims: a variant of the ChernofF bound (c.f., e.g., 
[4]) and Robbins' formula [9] (a tight version of Stirling's formula). 

Claim. Let Y be the sum of mutually independent indicator random variables, 
fi = E[Y]. For all 1 < ^ < 2, 

Pr[^ < F < M > 1 - 2e-(*-i)'''/«. 



Claim. For every integer n > 1, 



Now wc state the results for (5-balanced (n, k, Z)-splitters of the three types: 

k = I, k < I and k > I. 

Theorem 1. For any 1 < 5 < 2, there exists a 5-balanced (n, k)-family of perfect 
hash functions of size (^i')f" )- 

Proof, (sketch) Set p = k\/k^ and M = r^^^^j^^jjT^l • We choose M independent 

random functions. For a specific set S e ('^ ), the expected number of functions 
that are 1-1 on S is exactly pM. By the Chernofl[ bound, the probability that 
for at least one set S G ('^'), the number of functions that are 1-1 on S will not 
be as needed is at most 



2g-(5-irpM/8 < 2 je-C^i^^+i) < 1. 
kj ~ \k, 



□ 



Theorem 2. For any k < I and 1 < S < 2, there exists a 6-balanced {n,k,l)- 
splitter of size 0(2^^^^^^). 

Proof, (sketch) We set p = (^i_''lyik and M = [^^^^jji^l • Using Robbins' for- 
mula, wc get 



- < e''+^/^^(l - _)'-'=+V2 < gft+i/i2g-f (;-fc+i/2) 
p ~ I ~ 



+ 1/12 



We choose M independent random functions and proceed as in the proof of 
Theorem 1. □ 

For the case k > I, the probabilistic arguments from [15] can be generalized 
to prove existence of balanced (n, k, Z)-splitters. Here we focus on the special 
case of balanced (n, k, 2)-splitters, which will be of interest later. 



Theorem 3. For any k >2 and 1 < i5 < 2, there exists a 6-balanced (n,k,2)- 

'c^/H log n 
(5-1)^ 



splitter of size 0(^#tp^). 



Proof, (sketch) Set M = [^g^^ 

] , where p denotes the probabiHty to get the 
needed spHt in a random function. If follows easily from Robbins' formula that 
l/p = 0{Vk). We choose M independent random functions and proceed as in 
the proof of Theorem 1. □ 



4 Explicit Constructions 

In this paper, we use the term explicit construction for an algorithm that lists all 
the elements of the required family of functions in time which is polynomial in 

the total size of the fimctions. For a discussion on other definitions for this term, 
the reader is referred to [15]. We state our results for (5-balanced (n, k, ^)-splitters 
of the three types: k = I, k < I and k > I. 

Theorem 4. For any 1 < 6 < 2, a 5-balanced {n,k)-family of perfect hash 
functions of size (^i"!" ) can he constructed deterministically within time 

\k) (<5-l)^ • 

Proof We set p = kl/k'' and M = [i^^^^^fp^]. Denote X = {6 - l)/4, so 
obviously < A < 1/4. Consider a choice of M independent random functions 
from [n] to [k] . This choice will be derandomized in the course of the algorithm. 
For every S € ('^^)) we define Xs = J2i=i-^s,i, where Xs,i is the indicator 
random variable that is equal to 1 iff the ith function is 1-1 on S. Consider the 
following potential function: 

se(H) 

Its expectation can be calculated as follows: 

s M M 

^ (p-\pM 

.k 



1=1 i=l 



= (^^)(e-^f^[pe^ + (1 -p)]^ + e^f^[pe-^ + (1 -p)]^). 

We now give an upper bound for E[<P]. Since 1 + w < e" for all u and e~" < 
l-u + uV2 for allu > 0, we get that pe"^ + (1 -p) < ef(^"^-i) < eP^'^+^^/'^h 
Define c = - 1, that is A = ln(l + e). Thus pe^ + {1 - p) = 1 + ep < e^P. This 
implies that 



+ e 

Since e" < 1 + « + for all < u < 1, we have that ^ = 6*=^"^"-^ < e^^ . We 
conclude that 



We now describe a deterministic algorithm for finding M functions, so that 
E[$\ will still obey the last upper bound. This is performed using the method of 
conditional probabilities (c.f., e.g., [4], chapter 15). The algorithm will have M 
phases, where each phase will consist of n steps. In step i of phase j the algorithm 
will determine the ith value of the jth function. Out of the k possible values, 
we greedily choose the value that will decrease as much as possible. We 
note that at any specific step of the algorithm, the exact value of the conditional 
expectation of the potential function can be easily computed in time {^k'~'^^\ 

After all the M functions have been determined, every set S € ('^') satisfies 
the following: 

gA(Xs-pM) _|_gA(pM-Xs) < g2(feliin+l)_ 

This implies that 

-2(fclnn+ 1) < \{Xs-vM) < 2(fclnn + 1). 
Recall that X = {S — and therefore 

8(fclnn + l), ,^ 8(fclnTi + l), ,^ 

Plugging in the values of M and p we get that 

(1 - ^)pM <Xs<{l+ ^-^)pM. 

Using the fact that 1/u < 1 — {u — l)/2 for all 1 < u < 2, we get the desired 
result 

pM/S <Xs< 6pM. 

□ 

Theorem 5. For any I < S < 2, a S-balanced {n,k,\^^])- splitter of size 
(j-i)o(i) can be constructed m time ^j_]^jo(T) • 



Proof. Denote q = ■ Consider an explicit construction of an error correcting 

code with n codewords over alphabet [q] whose normalized Hamming distance is 
at least 1 ^ |- Such explicit codes of length 0{q'^ logn) exist [1]. Now let every 
index of the code corresponds to a function from [n] to [q]. If we denote by M 
the length of the code, which is in fact the size of the splitter, then for every 
5 e ('^'), the number of good splits is at least 

where the last inequality follows from the fact that 1 — (u — l)/2 > 1/m for all 
1< M < 2. □ 



For our next construction we use small probability spaces that support a 
sequence of almost fc-size independent random variables. A sequence Xi, . . . , 
of random Boolean variables is (e, A;)-independent if for any k positions ii < 
■ ■■ <ik and any k bits ai, . . . , a/c we have 

|Pr[Xi, =ai,...,Xi, =Qfc]-2-'=| <e. 

It is known ([14], [2], [1]) that sample spaces of size 2*^'^'^+'°8^ « Hogn that sup- 
port n random variables that are (e, fc)-independent can be constructed in time 
20(fe+iogi)^lQg^^ 

Theorem 6. For any k > I and 1 < 6 <2, a d-balanced {n, k, l)-splitter of size 
20(fciogi-iog(5-i))iQg^ can be constructed in time 20('=^°e'-'°g('^-i))nlogn. 

Proof. We use an explicit probability space of size 2'^'^'^'°s'~'°s(*~^') logn that 
supports n[log2/] random variables that are (e, fc[log2 ^] )-independent where 
e = 2~'^r'°S2 — 1). We attach [log2 /] random variables to each element of 

[n], thereby assigning it a value from [2^^°S2n]_ in case I is not a power of 2, all 
elements of [2 r'°S2 '1 ] _ [/] be mapped to [I] by some arbitrary fixed fmiction. 
If follows from the construction that there exists a constant T > so that for 
every S G ('^'), the number of good splits satisfies 

y < (1 - < splitiS) < (1 + < ST. 

□ 

Corollary 1. For any fixed c > 0, a (1 + c~'^)- balanced {n, k, 2)-splitter of size 
20(fc) logn can be constructed in time 2'^('^'nlogn. 

Setting I = k in Theorem 6, wc get that a (5-balanced (n, fe)-family of per- 
fect hash functions of size 2'^('^'°s'^~'°s(<5-i)) jQg (j^^ ^e constructed in time 
20(/ciogfc-iog((5-i))^ jQg^_ Note that if k is small enough with respect to n, say 
k = 0(logn/ log logn), then for any fixed 1 < 6 < 2, this already gives a family 
of functions of size polynomial in n. Wc improve upon this last result in the 
following Theorem, which is our main construction. 

Theorem 7. For 1 < S < 2, a S-balanced {n,k)-family of perfect hash func- 
tions of size (g_^-)0(iog k) iogn can be constructed m time (j_x)o(iogfc) niogn-l- [d — 
lyoik/iogk)^ /n particular, for any fixed l<d<2, the size is 20('= '°g i°s logn 
and the time is 2<^('='°e'°s'=)nlogn. 

Proof (sketch) Denote / = riog2 k] ,6' = 6^/^, 6" = (J^/f^O, and q = Let 
Hhea ^'-balanced {q, k, 0-splitter of size 2'='('= log '"g fe) (^' - i)-O(i) constructed 
using Theorem 6. For every j, j = let Bj be a ^''-balanced (q.kj)- 

family of perfect hash functions of size 0{e''^ ^°^''k){S" — l)~'^(i) constructed 
using Theorem 4, where kj = [fc/Z] for every j < k mod I and kj = [k/l\ 



otherwise. Using Lemma 2 for composing H and {Bj}^^^, we get a (5'^-balanced 
(g, fc)-family D' of perfect hash functions. 

Now let D" be a (^'-balanced (n, /c, g)-spHtter of size k^'^^\6' - ly^'^^hogn 
constructed using Theorem 5. Using Lemma 1 for composing D' and D" , we 
get a 5-balanced (n, fc)-family of perfect hash functions, as needed. Note that for 
calculating the size of each Bj, we use the fact that e^^^ < 1 + u < e" for all 
< u < 1, and get the following: 

S"-l = {l + {d- l))w - 1 > - 1 > 

The time needed to construct each Bj is 2^('=)((5'- The 2°('=) term 

is omitted in the final result, as it is negligible in respect to the other terms. 

□ 



5 Approximate Counting of Paths and Cycles 

We now state what it means for an algorithm to approximate a counting problem. 

Definition 3. We say that an algorithms approximates a eounting problem, by 
a multiplicative factor 6 > 1 if for every input x, the output ALG{x) of the 
algorithm satisfies N(x)/5 < ALG{x) < 6N{x), where N{x) is the exact output 
of the counting problem for input x. 

The technique of color-coding is used for approximate counting of paths and 
cycles. Let G = {V, E) be a directed or undirected graph. In our algorithms 
we will use constructions of balanced (|F|, fc)-families of perfect hash functions. 
Each such function defines a coloring of the vertices of the graph. A path is said 

to be colorful if each vertex on it is colored by a distinct color. Our goal is to 
count the exact number of colorful paths in each of these colorings. 

Theorem 8. For any \ < 5 < 2, the number of simple (directed or undirected) 
paths of length k — 1 in a (directed or undirected) graph G = {V,E) can be 
approximated up to a multiplicative factor of 6 in time ^^^^)o(tog |-^| log |^| + 
(S - i)-0(fc/iogfe). 

Proof, (sketch) We use the J-balanced fc)-family of perfect hash functions 
constructed using Theorem 7. Each function of the family defines a coloring of 
the vertices in k colors. We know that there exists a constant T > 0, so that 
for each set 5 C y of A; vertices, the number of functions that are 1-1 on S is 
between T/ S and ST. The exact value of T can be easily calculated in all of our 
explicit constructions. 

For each coloring, we use a dynamic programming approach in order to cal- 
culate the exact number of colorful paths. We do this in k phases. In the ith 
phase, for each vertex v £ V and for each subset C C {1, . . . , fc} of « colors, we 
calculate the number of colorful paths of length i — 1 that end at v and use the 



colors of C. To do so, for every edge (u, v) G E, we check whether it can be the 
last edge of a colorful path of length i—1 ending at either uovv. Its contribution 
to the number of paths of length i — 1 is calculated using our knowledge on the 
number of paths of length i — 2. The initialization of phase 1 is easy and after 
performing phase k we know the exact number of paths of length k—1 that end 
at each vertex v €:V. The time to process each coloring is therefore 2'^('^)|£'|. 

We sum the results over all colorings and all ending vertices v €: V . The 
result is divided by T. In case the graph is undirected ,we further divide by 2. 
This is guaranteed to be the needed approximation. □ 

Theorem 9. For any 1 < 8 < 2, the mim,ber of simple (directed or undirected) 
cycles of size k in a (directed or undirected) graph G = {V,E) can be approxi- 
mated up to a multiplicative factor of S in time jgz:Y)i{i^kj\E\\V\ log \V\ + {S — 
iyO{k/iosk)_ 

Proof, (sketch) We use the (5-balanced (|y|, fc)-family of perfect hash functions 
constructed using Theorem 7. For every set S of k vertices, the number of func- 
tions that are 1-1 on S is between T/5 and ST. Every function defines a coloring 
and for each such coloring we proceed as follows. For every vertex s G V we run 
the algorithm described in the proof of Theorem 8 in order to calculate for each 
vertex v € V the exact number of colorful paths of length k — 1 from s to v. 
In case there is an edge {v, s) that completes a cycle, we add the result to our 
count. 

We sum the results over all the colorings and all pairs of vertices s and v as 
described above. The result is divided by kT. In case the graph is undirected, 
we further divide by 2. The needed approximation is achieved. □ 

Corollary 2. For any constant c> 0, there is a deterministic polynomial time 
algorithm for approximating both the number of simple paths of length k and 
the number of simple cycles of size k for every k < 0( . 1°^^^^ ) in a graph 
with n vertices, where the approximation is up to a multiplicative factor of 1 + 
(Inlnn)-^'"'"". 

6 Concluding Remarks 

— An interesting open problem is whether for every fixed ^ > 1, there ex- 
ists an explicit 5-balanced (n, fc)-family of perfect hash functions of size 
20(fe) logn. The key ingredient needed is an improved construction of bal- 
anced (n, k, 2)-splitters. Such splitters can be applied successively to get the 
balanced (n, k, \\0g2 fc] )-splitter needed in Theorem 7. It seems that the con- 
structions presented in [2] could be good candidates for balanced (n, k, 2)- 
splitters, although the Fourier analysis in this case (along the lines of [8]) 
seems elusive. 

— Other algorithms from [5] can be generalized to deal with counting problems. 
In particular it is possible to combine our approach here with the ideas of 



[5] based on fast matrix multiplication in order to approximate the num- 
ber of cycles of a given length. Given a forest F on k vertices, the number 
of subgraphs of G isomorphic to F can be approximated using a recursive 
algorithm similar to the one in [5]. For a weighted graph, wc can approxi- 
mate;, for example, both the number of minimum (maximum) weight paths 
of length k—1 and the number of minimum (maximum) weight cycles of size 
k. Finally, all the results can be readily extended from paths and cycles to 
arbitrary small subgraphs of bounded tree-width. We omit the details. 
— In the definition of a balanced (n, fc)-family of perfect hash functions, there 
is some constant T > 0, such that for every S C [n], \S\ = k, the number 
of functions that are 1-1 on 5* is close to T. Wc note that the value of T 
need not be equal to the expected number of 1-1 functions on a set of size 
k, for the case that the functions were chosen independently according to 
a uniform distribution. For example, the value of T in the construction of 
Theorem 7 is not even asymptotically equal to what one would expect in a 
uniform distribution. 
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